Learning Temporal Sequences using Bag of Features
ثبت نشده
چکیده
This paper presents a novel method for learning classes of temporal sequences using a bag-of-features approach. Bag-of-features techniques have recently been introduced with success in the computer vision community for image classification, object recognition and categorization. We transpose these techniques from the image domain to the temporal domain. More exactly, we show that such techniques can be applied to the case where inputs are temporal sequences of features representing events to classify, and therefore for tasks such as gesture and activity recognition. In our approach, a codebook of local of temporal patches, or codewords, is automatically constructed from a set of sample sequences. Temporal sequences are then encoded using accumulated histograms of parts from this dictionary. Based on this representation, a multi-class classifier, treating the bag of features as the feature vector, is applied to estimate which class to assign to the temporal sequence. Obviously, this type of approach would not be able to explicitly learn the temporal relationships between the features in the sequence. However, as our study shows, the use of overlapping descriptors evaluated at multiple temporal scales are able to implicitly learn these temporal relationships. The segmentation problem is also addressed. For each learned temporal patch, a temporal model of the sequence boundaries is learned and used to simultaneously segment and classify the temporal sequences. Finally, extensive experiments are performed on three different datasets to compare our method against state-ofthe-art algorithms, including Hidden Markov Models, Conditional Random Fields and Support Vector Machine. The results show that our algorithm performs better and requires less training data than competing algorithms.
منابع مشابه
Learning Temporal Sequences using Bags of Features
This paper presents a novel method for learning classes of temporal sequences using a bag-of-features approach. We define a temporal sequence as a bag of temporal features and show how this representation can be used for the recognition and segmentation of temporal events. A codebook of temporal descriptors, representing the local temporal texture, is automatically constructed from a set of sam...
متن کاملHand Gesture Recognition from RGB-D Data using 2D and 3D Convolutional Neural Networks: a comparative study
Despite considerable enhances in recognizing hand gestures from still images, there are still many challenges in the classification of hand gestures in videos. The latter comes with more challenges, including higher computational complexity and arduous task of representing temporal features. Hand movement dynamics, represented by temporal features, have to be extracted by analyzing the total fr...
متن کاملRecognizing Conversational Interaction Based on 3D Human Pose
In this paper, we take a bag of visual words approach to investigate whether it is possible to distinguish conversational scenarios from observing human motion alone, in particular gestures in 3D. The conversational interactions concerned in this work have rather subtle differences among them. Unlike typical action or event recognition, each interaction in our case contain many instances of pri...
متن کاملKYOTO at the NTCIR-12 Temporalia Task: Machine Learning Approach for Temporal Intent Disambiguation Subtask
This paper describes the Kyoto system for Temporal Intent Disambiguation (TID) subtask in the NTCIR-12 Temporal Information Access (Temporalia-2) challenge. The task is to estimate the distribution of temporal intents (Past, Recency, Future, Atemporal) of a given query. We took a supervised machine learning approach, using features of bag of words, POS and word vectors. We also incorporated kno...
متن کاملAction Recognition using Temporal Bag-of-Words from Depth Maps
In this paper, we present a methodology for human action recognition from a sequence of depth maps obtained using Microsoft Kinect. Specifically, we use a Temporal Bag-of-Words model as representation scheme to capture the variation of features across the temporal domain. Our methodology builds the Temporal Bag-of-Words model on top of the spatiotemporal features extracted from interest points....
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006